The mean and variance of phylogenetic diversity under rarefaction.

نویسندگان

  • David A Nipperess
  • Frederick A Matsen
چکیده

Phylogenetic diversity (PD) depends on sampling depth, which complicates the comparison of PD between samples of different depth. One approach to dealing with differing sample depth for a given diversity statistic is to rarefy, which means to take a random subset of a given size of the original sample. Exact analytical formulae for the mean and variance of species richness under rarefaction have existed for some time but no such solution exists for PD.We have derived exact formulae for the mean and variance of PD under rarefaction. We confirm that these formulae are correct by comparing exact solution mean and variance to that calculated by repeated random (Monte Carlo) subsampling of a dataset of stem counts of woody shrubs of Toohey Forest, Queensland, Australia. We also demonstrate the application of the method using two examples: identifying hotspots of mammalian diversity in Australasian ecoregions, and characterising the human vaginal microbiome.There is a very high degree of correspondence between the analytical and random subsampling methods for calculating mean and variance of PD under rarefaction, although the Monte Carlo method requires a large number of random draws to converge on the exact solution for the variance.Rarefaction of mammalian PD of ecoregions in Australasia to a common standard of 25 species reveals very different rank orderings of ecoregions, indicating quite different hotspots of diversity than those obtained for unrarefied PD. The application of these methods to the vaginal microbiome shows that a classical score used to quantify bacterial vaginosis is correlated with the shape of the rarefaction curve.The analytical formulae for the mean and variance of PD under rarefaction are both exact and more efficient than repeated subsampling. Rarefaction of PD allows for many applications where comparisons of samples of different depth is required.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of genetic diversity, phylogenetic relationships and population structure of Arasbaran cornelian cherry (Cornus mas L.) genotypes using ISSR molecular markers

Cornelian cherry (Cornus mas L.), considered as the ancestor of cultivated trees in Arasbaran region, is a medicinally and economically plant species. However, little is known about genetic diversity, breeding programs, and population structure of this species in mentioned region. Keeping this in view, the main objectives of present study were to analysis the genetic diversity, phyloge...

متن کامل

Abundance-weighted phylogenetic diversity measures distinguish microbial community states and are robust to sampling depth

In microbial ecology studies, the most commonly used ways of investigating alpha (within-sample) diversity are either to apply non-phylogenetic measures such as Simpson's index to Operational Taxonomic Unit (OTU) groupings, or to use classical phylogenetic diversity (PD), which is not abundance-weighted. Although alpha diversity measures that use abundance information in a phylogenetic framewor...

متن کامل

Mitochondrial Diversity and Phylogenetic Structure of Marghoz Goat Population

The genetic diversity and phylogenetic structure was analyzed in Marghoz goat population by mitochondrial DNA sequences. Phylogenetic analysis was carried out using hyper variable region 1 (968 bp) obtained form 40 animals. Marghoz goat proved to be extremely diverse (average haplotype diversity of 0.999) and the nucleotide diversity values 0.022. A total of 40 Marghoz goats were grouped into s...

متن کامل

Mean and Variance of Phylogenetic Trees

Abstract.— We describe the use of the Fréchet mean and variance in the Billera-Holmes-Vogtmann (BHV) treespace to summarize and explore the diversity of a set of phylogenetic trees. We show that the Fréchet mean is comparable to other summary methods, despite its stickiness property, and that the Fréchet variance is faster and more precise than commonly used variance measures. These mean and va...

متن کامل

Investigation of General and Specific Combining Ability and Genetic Analysis of Different Traits of Bread Wheat under Non-Stress and Drought Stress Conditions

The production of new and compatible cultivars to different environments is one of the most important goals for the breeders. The crossing new cultivars and the selection of superior genotypes for desirable traits among their offspring is a method that has always been used by breeders. 28 genotypes obtained from the crossing of a 7 × 7 one-way diallel experiment consisting of seven parents (Alv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Methods in ecology and evolution

دوره 4 6  شماره 

صفحات  -

تاریخ انتشار 2013